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Abstract we focus on a recently new intelligent optimization algorithm called 
discrete state transition algorithm, for solving integer optimization problems. 
Firstly, we summarize some key elements for discrete state transition algo- 
rithm to guide its well development. Several intelligent operators are designed 
for local exploitation and global exploration. Then, a dynamic adjustment 
strategy "risk and restore in probability" is proposed to gain global conver- 
gence with high probability. Finally, numerical experiments are carried out to 
test the effectiveness of the proposed algorithm, and they show that the similar 
intelligent operators can be applied to ranging from traveling salesman prob- 
lem, boolean integer programming, to discrete value selection problem, which 
indicates the robustness and adaptability of the proposed intelligent elements. 

Keywords state transition algorithm • integer programming ■ traveling 
salesman problem • maximum cut problem • discrete value selection 



Xiaojun Zhou 

School of Information Science and Engineering, Central South University, Changsha 410083, 
China. 

School of Science, Information Technology and Engineering, University of Ballarat, Victoria 
3353, Australia. 

David Yang Gao 

School of Science, Information Technology and Engineering, University of Ballarat, Victoria 
3353, Australia. 

Chunhua Yang 

School of Information Science and Engineering, Central South University, Changsha 410083, 
China. 



2 



Xiaojun Zhou et al. 



1 Introduction 

In this paper, we consider the following unconstrained integer optimization 
problem 

min/(a;), (1) 

where, x — {x\, • • • , x n ), elc Z m , i = 1, • • • ,n. 

Generally speaking, the above optimization problem is NP-hard, which can 
not be solved in polynomial time (by enumeration, there exist m n choices). A 
direct method is to adopt the so called "divide-and-conquer" strategy, which 
separates the optimization problem into several subproblems and then solve 
these subproblems step by step. Branch and bound (B&B), branch and cut 
(B&C), and branch and price (B&P) belong to this kind; however, these meth- 
ods are essentially in exponential time. An indirect method is to relax the op- 
timization problem by loosening its integrality constraints to continuity and 
then solve the continuous relaxation problem or its Lagrangian dual problem, 
including LP-based relaxation, SDP-based relaxation, Lagrangian relaxation, 
etc. Nevertheless, when rounding off the relaxation solution, there may cause 
some infeasibility or can only get approximate solution, and when using La- 
grangian dual, there may exist duality gap between the primal and the dual 
problem . 

On the other hand, some stochastic algorithms, such as genetic algorithm 
(GA) [TlfTo]. simulated annealing (SA) [6lfT7]. ant colony optimization (ACO) 
[2llllj. are also widely used for combinatorial optimization problems, which 
aim to obtain "good solutions" in reasonable time. In terms of the concepts 
of state and state transition, a new heuristic search algorithm called state 
transition algorithm (STA) has been proposed recently, which exhibits fan- 
tastic results in continuous function optimization 18,19,20. In three 
intelligent operators (geometrical operators) named swap, shift and symmetry 
have been designed for discrete STA to solve the traveling salesman prob- 
lem (TSP), and it shows that the discrete STA outperforms its competitors 
in respect to both time complexity and search ability. To better develop dis- 
crete STA for medium-size or large-size discrete optimization problems, in the 
study, we firstly build the framework of discrete state transition algorithm and 
propose the five key elements for discrete STA, of which, the representation 
of a decision variable in discrete STA, the local and global operators and the 
dynamic adjustment strategy are mainly studied. Four geometrical operators 
named swap, shift, symmetry and substitute are designed, which are intelli- 
gent due to their adaptability in various types of integer programming. The 
mixed strategies of "greedy criterion" and "risk and restore in probability" are 
proposed, in which, "greedy criterion" and "restore in probability" are used 
to guarantee good convergence performance, and "risk a bad solution in prob- 
ability" aims to escape from local optimality. Some applications ranging from 
traveling salesman problem to discrete value selection problem are studied. 
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2 The framework of discrete state transition algorithm 

If a solution to a specific optimization problem is described as a state, then 
the transformation to update the solution becomes a state transition. Without 
loss of generality, the unified form of discrete state transition algorithm can 
be described as 

f x k+1 = A k (x k ) © B k (u k ) 
[Vk+i = f(xk+i) 

where, x k e Z n stands for a current state, corresponding to a solution of 
a specific optimization problem; u k is a function of x k and historical states; 
A k {-), B k (-) are transformation operators, which are usually state transition 
matrixes; is a operation, which is admissible to operate on two states; / is 
the cost function or evaluation function. 

The first clement we want to mention is the representation of a solution 
to a discrete optimization problem. In discrete STA, we choose special repre- 
sentations, which can be easily manipulated by the intelligent operators. Why 
we call the operators "intelligent" due to its geometrical property (swap, shift, 
symmetry and substitute) , and a intelligent operator has the same geometrical 
function for different representations. The big advantage of such representa- 
tions and operators is that, after each state transformation, the newly created 
state is always feasible, avoiding the trouble in dealing with constraints. 

The second important thing in state transition algorithm is that, when 
a transformation operator is exerted on current state, the next state is not 
deterministic, that is to say, there are possibly different choices for the next 
state. It is not difficult to imagine that all possible choices will constitute a set, 
or a neighborhood. Then we execute several times of transformation (search 
enforcement SE) on current state, to sampling in the neighborhood. Sampling 
is the first key element in state transition algorithm, which can reduce the 
search space and avoid enumeration. 

As a intelligent optimization algorithm, the discrete state transition algo- 
rithm also comprises the following key elements: 

(1) local exploitation and global exploration. In optimization algorithms, it 
is very significant to design good local and global operators. The local exploita- 
tion can guarantee high precision of a solution and convergent performance 
of a algorithm, and the global exploration can avoid getting trapped into lo- 
cal minima or prevent premature convergence. In discrete optimization, it is 
extremely difficult to define a "good" local optimal solution due to its de- 
pendence on a problem's structure, which leads to the same difficulty in the 
definition of local exploitation and global exploration. Anyway, in the discrete 
state transition algorithm, we define the slow change to current solution by a 
transformation as local exploitation, while the big change to current solution 
by a transformation as global exploration. 

(2) self learning and regular communication. State transition algorithm 
behaves in two forms, one is individual-based, the other is population-based, 
which is certainly a extended version. The individual-based state transition 
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algorithm focuses on self learning, in other words, with emphasis on the oper- 
ators' designing and dynamic adjustment (details given in the following). Un- 
doubtedly, communication among different states is a promising strategy for 
state transition algorithm, as indicated in [20 . Through communication, states 
can share information and cooperate with each other. However, how to commu- 
nicate and when to communicate are key issues. In continuous state transition 
algorithm, intermittent exchange strategy was proposed, which means that 
states communicate with each other at a certain frequency in a regular way. 

(3) dynamic adjustment. It is a potentially useful strategy for state tran- 
sition algorithm. In the iteration process of a algorithm, the fitness value can 
decrease sharply in the early stage, but it stagnates in the late stage, due to 
the static environment. As a result, some perturbation should be added to 
activate the environment. In fact, dynamic adjustment can be understood and 
implemented in various ways. For example, the alternative use of different lo- 
cal and global operators is dynamic adjustment to some extent. Then, we can 
change the search enforcement, vary the cost function, reduce the dimension, 
etc. Of course, "risk a bad solution in probability" is another dynamic adjust- 
ment, which is widely used in simulated annealing (SA). In SA, the Metropolis 
criterion [9] is used to accept a bad solution: 



where, AE = f(xk+i) — f(%k), is the Boltzmann probability factor, T 
is the temperature to regulate the process of annealing. In the early stage, 
temperature is high, and it has big probability to accept a bad solution, while 
in the late stage, temperature is low, and it has very small probability to 
accept a bad solution, which is the key point to guarantee the convergence. 
We can see that the Metropolis criterion has the ability to escape from local 
optimality, but on the other hand, it will miss some "good solutions" as well. 

In this study, we focus on the individual-based STA, and the main process 
of discrete STA is shown in the pseudocode as follows 
l: repeat 

2: [Best,fBest] ■(— swap(funfcn,Best,fBest,SE,n,m_a) 

3: [Best,fBest] <— shift(funfcn,Best,fBest,SE,n,m_b) 

4: [Best,fBest] ■(— symmetry(funfcn,Best,fBest,SE,n,m_c) 

5: [Best,fBest] «— substitute (funfcn, Best, fBest,SE, set, n,m_d) 

6: if fBcst < fBcst* then > greedy criterion 

7: Best* <r- Best 

8: ffiest* <- fBcst 

9: end if 

10: if rand < p\ then > restore in probability, only in DSTA 

11: Best «- Best* 

12: ffiest <- fBest* 

13: end if 

14: until the maximum number of iterations is met 



probability p = exp( 



-AE 



(3) 
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As for detailed explanations, swap function in above pseudocode is given 
as follows for example 
l: State <!— op_swap(Bcst,SE,n,m_a) 
2: [newBest,fGBcst] «— fitness(funfcn, State) 

3: if fGBcst < fBest then > greedy criterion 

4: Best newBcst 
5: fBest <- fGBest 
6: else 

7: if rand < p2 then > risk in probability, only in DSTA 

8: Best «— ncwBest 

9: fBest <- fGBest 

10: end if 
11: end if 

From the pseudocodes, we can find that in the simple STA, only "greedy 
criterion" is adopted to accept a new "Best*", while in dynamic STA, in the 
whole, "greedy criterion" is adopted to keep the incumbent "Best*", in the 
partial, a bad solution "Best" is accepted in each state transformation at a 
probability p 2 , and in the same while, the "Best*" is restored in the process at 
another probability pi . The "risk a bad solution in probability" strategy aims 
to escape from local optimal, while the "greedy criterion" and "restore the 
incumbent best solution in probability" are to guarantee a good convergence. 



3 Theoretical analysis of the discrete STA 

Firstly, we define the concept of convergence for discrete STA 

\f(x k )-f(x*)\ <e,Vfc> N, (4) 

while, x* is a global optimum of a discrete optimization problem, e is a small 
constant, and N is a natural number. If e > 0, we can say that the algorithm 
converges to a e — suboptimal solution; if e = 0, we can say that the algorithm 
converges to a global minimum. However, if x* is a local minimum solution, 
then we can say that the algorithm converges to a e — suboptimal solution 
and a local minimum for e > and e = 0, respectively. 
Theorem 1 the discrete STA can at least converge to a local minimum. 

Proof. Let suppose the maximum number of iterations (denoted by M) is 
big enough, considering that the "greedy criterion" is used to keep the incum- 
bent best, then there must exist a number N < M, when k > N, no update of 
better solution will happen. That is to say, f(xk) = /(xjv), V k > N, where xn 
is the solution in the TVth iteration. Denote xm as the local minimum solution 
x*, and then \f{x k ) - f(x*)\ = 0. 

Theorem 2 the discrete STA can converge to a global minimum in probabil- 
ity. 

Proof. Let suppose x* — (ai, • • • ,a„) is a global minimum solution, and 
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%n = ipx,--- ,b n ) is the Nth best solution. If xn — x*, according to the 
"greedy criterion", f(xk) = /(%), V fc > N, which means that it converges to 
x* . Otherwise, there must exist a transformation, either swap, shift, symme- 
try or substitute, such that xn+i x = (ai, • • • ,b n ), which means that after l\ 
iterations, b\ will be changed into a\. If /(xN+h) < J{xn), the xn+i x is kept 
as incumbent best for next iteration, else, xn+i x is also kept as incumbent 
best for next iteration in probability. Following the similar way, after at most 

h + ■ ■ ■ + Ui iterations, XN+h-\ \-i n wu 1 be (aj., • • • , a n ) in probability. 

From the proof of Theorem 2, we can find that dynamic STA has higher 
probability than the simple STA to gain the global optimum. Some applica- 
tions are given to describe the details in the following. 



4 Application for traveling salesman problem 

Suppose Af = {1, • • • ,n} is the set of cities, the traveling salesman problem 
can be described as: given a set of n cities and the distance dij for each pair 
of cities i and j, find a roundtrip of minimal total length visiting each city 
exactly once. Typically, the traveling salesman problem is usually modeled as 
the following two representations [T2] . 
(LP-TSP): 

n n 

mm J2 J2 

x n i=i j=l 
n 

S.t. X ij — G Af 

1=1 

n 

E xn = l,Vi e Af 
E E x v > !> 

its jes 
VS c Af, S C Af \ S 
x^ e {0, 1}, Vi, j e Af, (5) 
where, the decision variable x^ is defined by 

1, if city i is followed by city j (6a) 
0, otherwise (6b) 

(QP-TSP): 

n n n 

min E E E Xijd ik {x k (j +Vj +x k(J _ 1) ) 
x a i—i k=i j=i 

n 

s.t. E x v = 1, Vj e Af 

i=l 



^ -e {0,l},Vi,j eAf, (7) 
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here, the decision variable Xij is defined by 

{1 , if city i is in the jth position (8a) 
0, otherwise (8b) 

We can find that the model of linear programming (LP) based TSP is 
different from the model of quadratic programming (QP) based TSP in two 
aspects, one is the definition of the decision variable Xij, the other is that 
(LP-TSP) has one more constraint than (QP-TSP). Taking the difficulty of 
dealing with constraints into consideration, in discrete STA, we use another 
simple representation which can be easily manipulated by intelligent operators. 



4.1 Transformation operators for TSP 



As for a n-city traveling salesman problem, a permutation of {1, 2, ■ • • , n} is 
used to represent a solution to the problem in discrete STA. Based on the rep- 
resentation, three special transformation operators are proposed to illustrate 
local and global search. 
(1) swap transformation 



Xk+l 



A s k wap (m a )x k : 



(9) 



where, A 



swap 



is called swap transformation matrix, m a is a constant 



integer called swap factor to control the maximum number of positions to 
be exchanged, while the positions are random. If m a = 2, we call the swap 
operator local exploitation, and if m a > 3, the swap operator is regarded 
as global exploration in this case. Figure [T] gives the function of the swap 
transformation graphically when m a = 2. 




swap(2, 5) 
► 



4 N 



12 3 4 5 6 

Fig. 1 illustration of swap transformation for TSP 



2 



1 5 3 4 2 6 



(2) shift transformation 



Xk+l 



a shift/ \ 

A k (m b )x k , 



(10) 
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where, A 8 ^ 1 e M™ x " is called shift transformation matrix, m& is a constant 
integer called shift factor to control the maximum length of consecutive posi- 
tions to be shifted. By the way, the selected position to be shifted after and 
positions to be shifted are chosen randomly. Similarly, shift transformation 
is called local exploitation and global exploration when mj, = 1 and nib > 2 
respectively. To make it more clearly, if mj = 1, we set position 3 to be shifted 
after position 5, as described in Figure [2] 




123456 124536 

Fig. 2 illustration of shift transformation for TSP 



(3) symmetry transformation 

x k+ i = Al ym (m c )x k , (11) 

where, A s k ym £ K nx ™ is called symmetry transformation matrix, m c is a con- 
stant integer called symmetry factor to control the maximum length of subse- 
quent positions as center. By the way, the component before the subsequent 
positions and consecutive positions to be symmetrized are both created ran- 
domly. Considering that the symmetry transformation can make big change 
to current solution, it is intrinsically called global exploration. For instance, 
if m c = 0, let choose component 3, then the subsequent position or the cen- 
ter is {0}, the consecutive positions are {4,5}, and the function of symmetry 
transformation is given in Figure [3] 



4.2 TSP instances for test 

To evaluate the performance of the simple STA and the dynamic STA (DSTA), 
4 medium-size TSP instances are used for test. We set m a = 2, m& = 1, m c = 
0,md — l,pi = 0.1459,^2 = 0.0557 and the maximum number of iterations 
at 1500. Programs are run independently for 20 trails for each algorithms 
in MATLAB R2010b (version of 7.11.0.584) on Intel(R) Core(TM) i3-2310M 
CPU @2.10GHz under Window 7 environment, and for each run, it costs about 
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123456 154326 



Fig. 3 illustration of symmetry transformation for TSP 



0.5 minutes. Some statistics as well as the "error" are computed for compari- 
son, where, "error" is defined by 

best — optimum „. 

error = - x 100%, 

optimum 

where, best is the best result achieved by discrete STA, optimum is the incum- 
bent best result in TSPLIB [10 . Experimental results for these TSP instances 
are given in Table [TJ 



Table 1 experimental results for TSP instances 



instance 


optimum 


algorithm 


best 


mean 


s.t. 


error 


gr96.tsp 


512.3094 


STA 


525.0124 


544.9243 


13.2005 


2.48% 






DSTA 


515.5143 


549.9838 


17.1709 


0.63 % 


kroAlOO.tsp 


2.1285e4 


STA 


2.2214e4 


2.2970e4 


515.9307 


4.37% 






DSTA 


2.1910c4 


2.2740c4 


700.3377 


2.94% 


kroClOO.tsp 


2.0751e4 


STA 


2.1570e4 


2.2525e4 


679.3821 


3.95% 






DSTA 


2.1084e4 


2.2186e4 


703.0488 


1.60% 


gr!20.tsp 


1.6665c3 


STA 


1.6826c3 


1.7269e3 


30.0698 


0.97% 






DSTA 


1.6418e3 


1.7257e3 


34.4110 


-1.48% 



As can be seen from Tabled! the biggest error is no more than 5%, and 
the DSTA gets much more better results than the STA. Especially for the 
grl20.tsp, as emphasized by the bold font, DSTA achieves the better result 
than the incumbent best result given in the TSPLIB, with the sequence of (82 
11 51 23 9 103 119 4 38 7 56 41 87 14 75 44 46 20 50 98 42 17 118 49 13 113 
69 65 68 91 58 79 100 33 52 25 108 43 107 18 19 117 31 66 85 22 81 94 86 78 
15 59 76 1 16 61 29 120 32 30 92 28 45 74 105 72 40 60 24 111 96 90 54 8 70 
116 34 26 71 47 55 6 89 112 110 48 102 101 114 106 104 36 84 35 10 99 62 37 
67 83 39 57 73 63 77 95 12 97 88 21 115 2 93 109 64 53 5 27 80 3) and length 
of route at 1.6418e3, which is illustrated in Figure |H 
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gr120 

300 r 




I ' ' 1 ' 1 ' 1 1 ' 

20 40 60 80 100 120 140 160 

x 



Fig. 4 the best route of grl20.tsp obtained by DSTA 

5 Application for boolean integer programming 

In boolean integer programming (BIP), a solution comprise a series of boolean 
values (I — {0,1} or I = { — 1,1}). Swap, shift and symmetry operators are 
also proposed for internal transformation (operators aiming to change the in- 
ternal components of a sequence), and another operator called substitute is 
designed for external transformation (operator aiming to bring new compo- 
nents into a sequence). It should be noted that I = {0,1} is the same to 
X = { — 1, 1} under such circumstances, although there exists a linear transfor- 
mation relationship between them in other studies. 

5.1 Transformation operators for boolean integer programming 

As we said previously, the same intelligent operator has the same geometrical 
property for different applications. It is not difficult to imagine that the swap, 
shift and symmetry operators for boolean integer programming have the same 
formulation as that of traveling salesman problem. Let I — {0, 1}, the illus- 
trations of internal transformation are given from Figure [S] to Figure [7] 

Now, let introduce the external transformation. 
(4) substitute transformation 

x k+ i=A s k ub (m d )x k , (12) 

where, A s k ub g M. nxn is called substitute transformation matrix, rrid is a con- 
stant integer called substitute factor to control the maximum number of po- 
sitions to be substituted. By the way, the positions are randomly created. If 
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1 


1 





1 








1 



wap(2, 6) 
► 









1 





1 


1 





1 



Fig. 5 illustration of swap transformation for BIP 






1 


1 





1 








1 



shift(2,6) 
► 



Fig. 6 illustration of shift transformation for BIP 



symmetry((2,3\ (4, 5)) 
► 






1 


1 





] 








1 








-> 





U 





1 


f) 


[ 





1 





1 


) 







] 





1 


] 








1 




<- 









Fig. 7 illustration of symmetry transformation for BIP 



md = 1, we call the substitute operator local exploitation, and if irid > 2, 
the substitute operator is regarded as global exploration in this case. Figure [5] 
gives the function of the substitute transformation vividly when = 1. 






1 


1 





1 








1 



substitute{2) 
► 









1 





1 








1 



Fig. 8 illustration of substitute transformation for BIP 



5.2 MAX- CUT instances for test 

Let G = (V,E) be an undirected graph with edge weight toy on n + 1 = \V\ 
vertices and m = \E\ edges, for each edge E E, the maximum cut problem 
(MAX-CUT) is to find a subset S of the vertex set V such that the total weight 
of the edges between S and its complementary subset S = V\S is as large as 
possible. 

(LP based MAX-CUT model) [7]: 

Considering a variable for each edge 6 E, and assuming yij to be 
1 if is in the cut, and otherwise, the MAX-CUT can be modeled as the 
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following linear programming (LP) optimization problem: 

71+ 1 

max W(y) = Yl 

s.t. y is the incidence vector of a cut. (13) 

Here the incidence vector y = {ytj} £ R m , where the m is the number of edges 
in the graph. 

Let CUT(G) denote the convex hull of the incidence vectors of cuts in G. 
Since maximizing a linear function over a set of points equals to maximizing it 
over the convex hull of this set of points, we can rewrite (ITBl to the following 

max W(y) = c T y 

s.t. y e CUT(G). (14) 

where c = G K m . 

(SDP based MAX-CUT model) [13] : 

For a bipartition (S 1 , S), with = 1 if i £ S, and yi = — 1 otherwise, the 
MAX- CUT can also be formulated as the following semidefinite programming 
optimization problem: 

max W(y) = J 53 ~ VlV ^ 

«=i i=i 

S.t. y € {-1, !}«+!. (15) 

Without loss of generality, if we fix the value of the last variable at 1, then 
the problem (|15p is equivalent to the integer quadratic optimization problem 

(V) : min |p(x) = ix T Qx - x T c : x G {-1, 1}™}, (16) 

where, Q — {Qij} is a symmetric matrix with Qij = Wij(i,j — 1,2, •• • ,n), 
and c = — (u>i( n+ i), • • • , w„( n+ i)) T . It is not difficult to find that a optimal so- 
lution x* to problem (fT6|) corresponds to a optimal solution (x*, 1) of original 
problem (fT5"|) . 

Considering that the SDP based MAX-CUT model is easier to manipulate 
for intelligent operators, it is adopted in discrete STA for simple representa- 
tion. The parameters setting is the same to that in TSP instances. Under the 
circumstance, it takes about 8 seconds for each run. In the same way, we define 
the following "error" 

optimum — best „ 

error = x 100%, 

optimum 

where, best is the best result achieved by discrete STA, optimum can be found 
in [T3]. Experimental results for MAX-CUT instances are given in Tabled) 

As can be seen from Table [H the discrete STA has the ability to achieve 
the global minimum for all of the instances, and the difference between simple 
STA and DSTA is small in this case. 
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Table 2 experimental results for MAX-CUT instances 



instance 


optimum 


algorithm 


best 


mean 


s.t. 


error 


gr96 


105328 


STA 


105328 


105051 


567 









DSTA 


105328 


104982 


614 





kroAlOO 


5897392 


STA 


5897392 


5864276 


46302 









DSTA 


5897392 


5887930 


29122 





kroClOO 


5890760 


STA 


5890760 


5860540 


34277 









DSTA 


5890760 


5870613 


31573 





grl20 


2156667 


STA 


2156667 


2150378 


7132 









DSTA 


2156667 


2150378 


7132 






6 Application for discrete value selection 

Typically, the model of discrete value selection (DVS) problem is different from 
the model in ([1]), because the domain is defined as follows 

Xi eU = {ui, ■ ■ ■ ,u m },Uj e R,j = 1, •■• , m. (17) 

By introducing a linear transformation 

K 

Xi = ^2u j y ij , (18) 
i=i 

where 

K 

5>« = i,v«e {o,i}, (19) 

3=1 

then the discrete value selection can be rewritten to the equivalent constrained 
boolean integer programming problem |16] . 

In discrete STA, we only use the index of Uj to represent a solution, for 
example, a solution (1,3,2) is corresponding to {ui,u$,uz), which is easy to 
be manipulated by the intelligent operators. 



6.1 Transformation operators for discrete value selection 

The intelligent operators swap, shift, symmetry and substitute for discrete 
value selection are similar to that in boolean programming problem. As a 
result, only illustrations of these transformations are given from Figure [5] to 
Figure HU 
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k 



] 


2 


3 


3 


1 


3 


2 


1 



wap(2, 6) 
► 



k 



1 


3 


3 


3 


1 


2 


2 


1 



Fig. 9 illustration of swap transformation for DVS 



k 



] 


2 


3 


3 


1 


3 


2 


1 



shift(2^) 
► 



1 


3 


3 


1 


3 


2 


2 


1 



Fig. 10 illustration of shift transformation for DVS 
I 



synmetry{{ % 3 ),. ( 4, 5) ) 



1 


2 


3 


3 


] 


3 


2 


1 


► 


1 


1 


3 


3 


2 


3 


2 


1 




\4 >\< ->\ 








<--►;< ► 





I I 

Fig. 11 illustration of symmetry transformation for DVS 



k 



] 


2 


3 


3 


1 


3 


2 


1 



sijbstitute(2) 
► 



k 



1 


1 


3 


3 


1 


3 


2 


1 



Fig. 12 illustration of substitute transformation for DVS 



6.2 The integer Rosenbrock function for test 

The continuous Rosenbrock function has been widely studied as a bench- 
mark. Considering that the Rosenbrock function also has many integer local 
minima, in this paper, it is the first time to use it as a benchmark for integer 
programming problem. The integer Rosenbrock function is defined as 

n-l 

f rosenbrock = ^[100(:r l+ i - xf f + ( Xi - 1) 2 ],X; G {-2,-1,0,1,2}. 

i=l 

It is not difficult to find that x = (xi, • • • , x n ) for any xi = 0, i = 1, • • • , n is 
a local minimum of the integer Rosenbrock function. 

The parameters setting is the same to that of MAX-CUT instances except 
the maximum number of iterations, which is specified as 10, 20, 100, 200, 500, 
2000 for n = 5,10,20,50,100,200, respectively. Under the circumstance, 0.1, 
0.15, 0.35, 0.6, 1.5, 6.0 seconds are consumed correspondingly. Experimental 
results for integer Rosenbrock function are given in Table El 

As can be seen from Table [31 both the simple STA and the dynamic STA 
do well in the low dimensional cases. For large scale problem, the simple STA 
begins to have poor performance and even fails to find the global optimum. 
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Table 3 experimental results for integer Roscnbrock function 
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algorithm 
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mean 
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20 
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50 





STA 


















DSTA 














100 





STA 





78.4500 


46.0669 









DSTA 














200 





STA 


107 


155.4500 


42.1495 








DSTA 















On the other hand, the dynamic STA is more robust and adaptable, which 
verifies the Theorem 2 that dynamic STA has higher probability to gain global 
optimum than the simple STA. 



7 Conclusion 

In this paper, a new intelligent optimization algorithm named discrete state 
transition algorithm is studied for integer programming problem. It is the first 
time to build the framework for discrete state transition algorithm, and five 
key elements are discussed to better develop the algorithm. The representa- 
tion of a feasible solution and the dynamic adjustment strategy are mainly 
studied. Various applications have shown the adaptability of the designed in- 
telligent operators and experimental results have testified the effectiveness and 
efficiency of the proposed algorithm and strategies. 
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